Prometheus 2 is a language model based on Mistral-Instruct, specializing in fine-grained evaluation and reward modeling for Reinforcement Learning from Human Feedback (RLHF), serving as an alternative to GPT-4 evaluation.
Large Language Model
Transformers English